首页> 外文OA文献 >On the Optimality Equation for Average Cost Markov Decision Processes and its Validity for Inventory Control
【2h】

On the Optimality Equation for Average Cost Markov Decision Processes and its Validity for Inventory Control

机译:平均成本马尔可夫决策过程的最优性方程   及其库存控制的有效性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

As is well known, average-cost optimality inequalities imply the existence ofstationary optimal policies for Markov Decision Processes with average costsper unit time, and these inequalities hold under broad natural conditions. Thispaper provides sufficient conditions for the validity of the average-costoptimality equation for an infinite state problem with weakly continuoustransition probabilities and with possibly unbounded one-step costs andnoncompact action sets. These conditions also imply the convergence ofsequences of discounted relative value functions to average-cost relative valuefunctions and the continuity of average-cost relative value functions. As shownin the paper, the classic periodic-review inventory control problem satisfiesthese conditions. Therefore, the optimality inequality holds in the form of anequality with a continuous average-cost relative value function for thisproblem. In addition, the $K$-convexity of discounted relative value functionsand their convergence to average-cost relative value functions, when thediscount factor increases to 1, imply the $K$-convexity of average-costrelative value functions. This implies that average-cost optimal $(s,S)$policies for the inventory control problem can be derived from the average-costoptimality equation.
机译:众所周知,平均成本最优不等式意味着存在马尔科夫决策过程的平稳最优策略,其平均时间为每单位时间,并且这些不等式在广泛的自然条件下成立。本文为具有弱连续转移概率,可能具有无穷的单步成本和非紧凑动作集的无限状态问题的平均成本最优方程的有效性提供了充分条件。这些条件还意味着折现后的相对价值函数与平均成本相对价值函数的序列收敛,以及平均成本相对价值函数的连续性。如本文所示,经典的定期审查库存控制问题可以满足这些条件。因此,最优不等式以不等式的形式存在,对此问题具有连续的平均成本相对值函数。另外,当折现系数增加到1时,折现相对价值函数的$ K $凸性和它们对平均成本相对价值函数的收敛性,意味着平均成本相对值函数的$ K $凸性。这意味着可以从平均成本最优性方程得出库存控制问题的平均成本最优$(s,S)$策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号